Goto

Collaborating Authors

 Ontario


Human Experts' Evaluation of Generative AI for Contextualizing STEAM Education in the Global South

Nyaaba, Matthew, Nabang, Macharious, Kyeremeh, Patrick, Nantomah, Ibrahim, Owusu-Fordjour, Collins, Ako, Martin, Akanzire, Bismark Nyaaba, Nantomah, Kassim Korah, Issaka, Cecilia, Zhai, Xiaoming

arXiv.org Artificial Intelligence

STEAM education in many parts of the Global South remains abstract and weakly connected to learners sociocultural realities. This study examines how human experts evaluate the capacity of Generative AI (GenAI) to contextualize STEAM instruction in these settings. Using a convergent mixed-methods design grounded in human-centered and culturally responsive pedagogy, four STEAM education experts reviewed standardized Ghana NaCCA lesson plans and GenAI-generated lessons created with a customized Culturally Responsive Lesson Planner (CRLP). Quantitative data were collected with a validated 25-item Culturally Responsive Pedagogy Rubric assessing bias awareness, cultural representation, contextual relevance, linguistic responsiveness, and teacher agency. Qualitative reflections provided additional insight into the pedagogical and cultural dynamics of each lesson. Findings show that GenAI, especially through the CRLP, improved connections between abstract standards and learners lived experiences. Teacher Agency was the strongest domain, while Cultural Representation was the weakest. CRLP-generated lessons were rated as more culturally grounded and pedagogically engaging. However, GenAI struggled to represent Ghana's cultural diversity, often producing surface-level references, especially in Mathematics and Computing. Experts stressed the need for teacher mediation, community input, and culturally informed refinement of AI outputs. Future work should involve classroom trials, broader expert participation, and fine-tuning with Indigenous corpora.


Adaptive Latent-Space Constraints in Personalized Federated Learning

Ayromlou, Sana, Tavakoli, Fatemeh, Emerson, D. B.

arXiv.org Artificial Intelligence

Federated learning (FL) is an effective and widely used approach to training deep learning models on decentralized datasets held by distinct clients. FL also strengthens both security and privacy protections for training data. Common challenges associated with statistical heterogeneity between distributed datasets have spurred significant interest in personalized FL (pFL) methods, where models combine aspects of global learning with local modeling specific to each client's unique characteristics. This work investigates the efficacy of theoretically supported, adaptive MMD measures in pFL, primarily focusing on the Ditto framework, a state-of-the-art technique for distributed data heterogeneity. The use of such measures significantly improves model performance across a variety of tasks, especially those with pronounced feature heterogeneity. Additional experiments demonstrate that such measures are directly applicable to other pFL techniques and yield similar improvements across a number of datasets. Finally, the results motivate the use of constraints tailored to the various kinds of heterogeneity expected in FL systems.


Out-of-Distribution Detection from Small Training Sets using Bayesian Neural Network Classifiers

Raina, Kevin, Schmah, Tanya

arXiv.org Machine Learning

Out-of-Distribution (OOD) detection is critical to AI reliability and safety, yet in many practical settings, only a limited amount of training data is available. Bayesian Neural Networks (BNNs) are a promising class of model on which to base OOD detection, because they explicitly represent epistemic (i.e. model) uncertainty. In the small training data regime, BNNs are especially valuable because they can incorporate prior model information. We introduce a new family of Bayesian posthoc OOD scores based on expected logit vectors, and compare 5 Bayesian and 4 deterministic posthoc OOD scores. Experiments on MNIST and CIFAR-10 In-Distributions, with 5000 training samples or less, show that the Bayesian methods outperform corresponding deterministic methods.


AutoGrid AI: Deep Reinforcement Learning Framework for Autonomous Microgrid Management

Guo, Kenny, Eckhert, Nicholas, Chhajer, Krish, Abeykoon, Luthira, Schell, Lorne

arXiv.org Artificial Intelligence

--We present a deep reinforcement learning-based framework for autonomous microgrid management. Using deep reinforcement learning and time-series forecasting models, we optimize microgrid energy dispatch strategies to minimize costs and maximize the utilization of renewable energy sources such as solar and wind. Our approach integrates the transformer architecture for forecasting of renewable generation and a proximal-policy optimization (PPO) agent to make decisions in a simulated environment. Our experimental results demonstrate significant improvements in both energy efficiency and operational resilience when compared to traditional rule-based methods. This work contributes to advancing smart-grid technologies in pursuit of zero-carbon energy systems. We finally provide an open-source framework for simulating several microgrid environments.


Personalized Control for Lower Limb Prosthesis Using Kolmogorov-Arnold Networks

Mohasel, SeyedMojtaba, Aghaei, Alireza Afzal, Pew, Corey

arXiv.org Artificial Intelligence

Objective: This paper investigates the potential of learnable activation functions in Kolmogorov-Arnold Networks (KANs) for personalized control in a lower-limb prosthesis. In addition, user-specific vs. pooled training data is evaluated to improve machine learning (ML) and Deep Learning (DL) performance for turn intent prediction. Method: Inertial measurement unit (IMU) data from the shank were collected from five individuals with lower-limb amputation performing turning tasks in a laboratory setting. Ability to classify an upcoming turn was evaluated for Multilayer Perceptron (MLP), Kolmogorov-Arnold Network (KAN), convolutional neural network (CNN), and fractional Kolmogorov-Arnold Networks (FKAN). The comparison of MLP and KAN (for ML models) and FKAN and CNN (for DL models) assessed the effectiveness of learnable activation functions. Models were trained separately on user-specific and pooled data to evaluate the impact of training data on their performance. Results: Learnable activation functions in KAN and FKAN did not yield significant improvement compared to MLP and CNN, respectively. Training on user-specific data yielded superior results compared to pooled data for ML models ($p < 0.05$). In contrast, no significant difference was observed between user-specific and pooled training for DL models. Significance: These findings suggest that learnable activation functions may demonstrate distinct advantages in datasets involving more complex tasks and larger volumes. In addition, pooled training showed comparable performance to user-specific training in DL models, indicating that model training for prosthesis control can utilize data from multiple participants.


SIM-Shapley: A Stable and Computationally Efficient Approach to Shapley Value Approximation

Fan, Wangxuan, Li, Siqi, Zhou, Doudou, Okada, Yohei, Hong, Chuan, Liu, Molei, Liu, Nan

arXiv.org Machine Learning

Explainable artificial intelligence (XAI) is essential for trustworthy machine learning (ML), particularly in high-stakes domains such as healthcare and finance. Shapley value (SV) methods provide a principled framework for feature attribution in complex models but incur high computational costs, limiting their scalability in high-dimensional settings. We propose Stochastic Iterative Momentum for Shapley Value Approximation (SIM-Shapley), a stable and efficient SV approximation method inspired by stochastic optimization. We analyze variance theoretically, prove linear $Q$-convergence, and demonstrate improved empirical stability and low bias in practice on real-world datasets. In our numerical experiments, SIM-Shapley reduces computation time by up to 85% relative to state-of-the-art baselines while maintaining comparable feature attribution quality. Beyond feature attribution, our stochastic mini-batch iterative framework extends naturally to a broader class of sample average approximation problems, offering a new avenue for improving computational efficiency with stability guarantees. Code is publicly available at https://github.com/nliulab/SIM-Shapley.


Evaluating Alternative Training Interventions Using Personalized Computational Models of Learning

MacLellan, Christopher James, Stowers, Kimberly, Brady, Lisa

arXiv.org Artificial Intelligence

Evaluating different training interventions to determine which produce the best learning outcomes is one of the main challenges faced by instructional designers. Typically, these designers use A/B experiments to evaluate each intervention; however, it is costly and time consuming to run such studies. To address this issue, we explore how computational models of learning might support designers in reasoning causally about alternative interventions within a fractions tutor. We present an approach for automatically tuning models to specific individuals and show that personalized models make better predictions of students' behavior than generic ones. Next, we conduct simulations to generate counterfactual predictions of performance and learning for two students (high and low performing) in different versions of the fractions tutor. Our approach makes predictions that align with previous human findings, as well as testable predictions that might be evaluated with future human experiments.


Social and Ethical Risks Posed by General-Purpose LLMs for Settling Newcomers in Canada

Nejadgholi, Isar, Molamohammadi, Maryam, Bakhtawar, Samir

arXiv.org Artificial Intelligence

The non-profit settlement sector in Canada supports newcomers in achieving successful integration. This sector faces increasing operational pressures amidst rising immigration targets, which highlights a need for enhanced efficiency and innovation, potentially through reliable AI solutions. The ad-hoc use of general-purpose generative AI, such as ChatGPT, might become a common practice among newcomers and service providers to address this need. However, these tools are not tailored for the settlement domain and can have detrimental implications for immigrants and refugees. We explore the risks that these tools might pose on newcomers to first, warn against the unguarded use of generative AI, and second, to incentivize further research and development in creating AI literacy programs as well as customized LLMs that are aligned with the preferences of the impacted communities. Crucially, such technologies should be designed to integrate seamlessly into the existing workflow of the settlement sector, ensuring human oversight, trustworthiness, and accountability.


Application of Artificial Intelligence in Supporting Healthcare Professionals and Caregivers in Treatment of Autistic Children

Rouzbahani, Hossein Mohammadi, Karimipour, Hadis

arXiv.org Artificial Intelligence

Treatment plans often involve multiple neurodevelopmental condition marked by difficulties in social sessions with different therapists, and the absence of a standardized interaction, communication impediments, and repetitive behaviors. This fragmented approach continue to pose significant challenges due to the variability in can impede effective communication and coordination among symptomatology and the necessity for multidisciplinary care healthcare providers, adversely affecting the quality of care. This paper investigates the potential of Artificial Furthermore, parents and caregivers may find it challenging to access Intelligence (AI) to augment the capabilities of healthcare and manage the extensive records necessary for consistent treatment, professionals and caregivers in managing ASD. We have developed further complicating the overall management of ASD. a sophisticated algorithm designed to analyze facial and bodily Artificial Intelligence (AI) presents a promising solution to the expressions during daily activities of both autistic and non-autistic complexities involved in diagnosing and treating Autism Spectrum children, leading to the development of a powerful deep learningbased Disorder (ASD) [6]. AI-powered tools have the potential to autism detection system. Our study demonstrated that AI standardize the diagnostic process by analyzing extensive datasets to models, specifically the Xception and ResNet50V2 architectures, uncover patterns and correlations that might be overlooked by human achieved high accuracy in diagnosing Autism Spectrum Disorder evaluators.


BPO: Supercharging Online Preference Learning by Adhering to the Proximity of Behavior LLM

Xu, Wenda, Li, Jiachen, Wang, William Yang, Li, Lei

arXiv.org Artificial Intelligence

Direct alignment from preferences (DAP) has emerged as a promising paradigm for aligning large language models (LLMs) to human desiderata from pre-collected, offline preference datasets. While recent studies indicate that existing offline DAP methods can directly benefit from online training samples, we highlight the need to develop specific online DAP algorithms to fully harness the power of online training. Specifically, we identify that the learned LLM should adhere to the proximity of the behavior LLM, which collects the training samples. To this end, we propose online Preference Optimization in proximity to the Behavior LLM (BPO), emphasizing the importance of constructing a proper trust region for LLM alignment. We conduct extensive experiments to validate the effectiveness and applicability of our approach by integrating it with various DAP methods, resulting in significant performance improvements across a wide range of tasks when training with the same amount of preference data. Even when only introducing one additional data collection phase, our online BPO improves its offline DAP baseline from 72.0% to 80.2% on TL;DR and from 82.2% to 89.1% on Anthropic Helpfulness in terms of win rate against human reference text.